Search CORE

109 research outputs found

Genetic architecture of sporadic frontotemporal dementia and overlap with Alzheimer's and Parkinson's diseases

Author: Andreassen OA
Bonham LW
Brewer JB
Dale AM
Desikan RS
Dillon WP
Fan CC
Ferrari R
Guelfi S
Hardy J
Hess CP
Karch CM
Miller BL
Momeni P
Rabinovici GD
Schellenberg GD
Schork AJ
Sugrue LP
Vandrovcova J
Wang Y
Witeolar A
Yokoyama JS
Publication venue: BMJ PUBLISHING GROUP
Publication date: 01/02/2017
Field of study

BACKGROUND: Clinical, pathological and genetic overlap between sporadic frontotemporal dementia (FTD), Alzheimer's disease (AD) and Parkinson's disease (PD) has been suggested; however, the relationship between these disorders is still not well understood. Here we evaluated genetic overlap between FTD, AD and PD to assess shared pathobiology and identify novel genetic variants associated with increased risk for FTD. METHODS: Summary statistics were obtained from the International FTD Genomics Consortium, International PD Genetics Consortium and International Genomics of AD Project (n>75 000 cases and controls). We used conjunction false discovery rate (FDR) to evaluate genetic pleiotropy and conditional FDR to identify novel FTD-associated SNPs. Relevant variants were further evaluated for expression quantitative loci. RESULTS: We observed SNPs within the HLA, MAPT and APOE regions jointly contributing to increased risk for FTD and AD or PD. By conditioning on polymorphisms associated with PD and AD, we found 11 loci associated with increased risk for FTD. Meta-analysis across two independent FTD cohorts revealed a genome-wide signal within the APOE region (rs6857, 3′-UTR=PVRL2, p=2.21×10–12), and a suggestive signal for rs1358071 within the MAPT region (intronic=CRHR1, p=4.91×10−7) with the effect allele tagging the H1 haplotype. Pleiotropic SNPs at the HLA and MAPT loci associated with expression changes in cis-genes supporting involvement of intracellular vesicular trafficking, immune response and endo/lysosomal processes. CONCLUSIONS: Our findings demonstrate genetic pleiotropy in these neurodegenerative diseases and indicate that sporadic FTD is a polygenic disorder where multiple pleiotropic loci with small effects contribute to increased disease risk

UCL Discovery

Free choice activates a decision circuit between frontal and parietal cortex

Author: A Gail
A Riehle
B Pesaran
B Pesaran
DM Halliday
DM Kreps
G Goldberg
H Scherberger
H Scherberger
J Rickert
JG Kerns
JI Gold
LP Sugrue
ML Platt
MT Schmolesky
ND Daw
P Cisek
P Cisek
P Fries
PB Johnson
PP Mitra
R Bogacz
R Quian Quiroga
R Romo
SL Bressler
SP Wise
T Yang
TJ Buschman
U Mitzdorf
VN Murthy
Publication venue: Nature Publishing Group
Publication date: 15/05/2008
Field of study

We often face alternatives that we are free to choose between. Planning movements to select an alternative involves several areas in frontal and parietal cortex that are anatomically connected into long-range circuits. These areas must coordinate their activity to select a common movement goal, but how neural circuits make decisions remains poorly understood. Here we simultaneously record from the dorsal premotor area (PMd) in frontal cortex and the parietal reach region (PRR) in parietal cortex to investigate neural circuit mechanisms for decision making. We find that correlations in spike and local field potential (LFP) activity between these areas are greater when monkeys are freely making choices than when they are following instructions. We propose that a decision circuit featuring a sub-population of cells in frontal and parietal cortex may exchange information to coordinate activity between these areas. Cells participating in this decision circuit may influence movement choices by providing a common bias to the selection of movement goals

Crossref

PubMed Central

Caltech Authors

When Does Reward Maximization Lead to Matching Law?

Author: A Soltani
B Alsop
DP Bertsekas
DR Shanks
GM Heyman
GS Corrado
J Mazur
JC Houk
JE Mazur
LP Sugrue
LT DeCarlo
M Davison
M Davison
M Davison
P Dayan
P Marbach
RJ Herrnstein
RJ Herrnstein
RJ Herrnstein
RJ Herrnstein
RS Sutton
SC Tanaka
Tim Bussey
Tomoki Fukai
VR Konda
W Schultz
WJ Vaughan
WM Baum
WM Baum
Y Loewenstein
Y Sakai
Y Sakai
Yutaka Sakai
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

What kind of strategies subjects follow in various behavioral circumstances has been a central issue in decision making. In particular, which behavioral strategy, maximizing or matching, is more fundamental to animal's decision behavior has been a matter of debate. Here, we prove that any algorithm to achieve the stationary condition for maximizing the average reward should lead to matching when it ignores the dependence of the expected outcome on subject's past choices. We may term this strategy of partial reward maximization “matching strategy”. Then, this strategy is applied to the case where the subject's decision system updates the information for making a decision. Such information includes subject's past actions or sensory stimuli, and the internal storage of this information is often called “state variables”. We demonstrate that the matching strategy provides an easy way to maximize reward when combined with the exploration of the state variables that correctly represent the crucial information for reward maximization. Our results reveal for the first time how a strategy to achieve matching behavior is beneficial to reward maximization, achieving a novel insight into the relationship between maximizing and matching

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Learning Priors for Bayesian Computations in the Nervous System

Author: AA Stocker
AJ Yu
AK Churchland
AL Fairhall
D Dima
H Chen
H Tassinari
J Burge
JB Tenenbaum
K Wei
KA Thoroughman
Konrad Kording
KP Kording
KP Kording
LP Sugrue
M Berniker
M DeWeese
M Kawato
M Miyazaki
Martin Voss
Max Berniker
MF Rushworth
PC Fletcher
R Kiani
R Nezafat
RA Jacobs
RF Lewis
TE Behrens
TE Hudson
Vladimir Brezina
W Schultz
WJ Adams
Y Weiss
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Our nervous system continuously combines new information from our senses with information it has acquired throughout life. Numerous studies have found that human subjects manage this by integrating their observations with their previous experience (priors) in a way that is close to the statistical optimum. However, little is known about the way the nervous system acquires or learns priors. Here we present results from experiments where the underlying distribution of target locations in an estimation task was switched, manipulating the prior subjects should use. Our experimental design allowed us to measure a subject's evolving prior while they learned. We confirm that through extensive practice subjects learn the correct prior for the task. We found that subjects can rapidly learn the mean of a new prior while the variance is learned more slowly and with a variable learning rate. In addition, we found that a Bayesian inference model could predict the time course of the observed learning while offering an intuitive explanation for the findings. The evidence suggests the nervous system continuously updates its priors to enable efficient behavior

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Behavioural Correlate of Choice Confidence in a Discrete Trial Paradigm

Author: A Kacelnik
A Kepecs
BY Hayden
CP Shimp
DH Bullock
Doron Lavan
DR Meyer
Ehsan Arabzadeh
GH Pyke
GH Pyke
James S. McDonald
JE Mazur
JR Krebs
JR Krebs
Justin Harris
LP Sugrue
M Adibi
M Bitterman
M Davison
MB Wilk
N Uchida
R Kiani
R. Frederick Westbrook
RJ Herrnstein
S Lea
SD Healy
TA Foster
V Graf
WL Woolverton
WM Baum
WM Baum
Y Sakai
Publication venue: Public Library of Science
Publication date
Field of study

How animals make choices in a changing and often uncertain environment is a central theme in the behavioural sciences. There is a substantial literature on how animals make choices in various experimental paradigms but less is known about the way they assess a choice after it has been made in terms of the expected outcome. Here, we used a discrete trial paradigm to characterise how the reward history shaped the behaviour on a trial by trial basis. Rats initiated each trial which consisted of a choice between two drinking spouts that differed in their probability of delivering a sucrose solution. Critically, sucrose was delivered after a delay from the first lick at the spouts – this allowed us to characterise the behavioural profile during the window between the time of choice and its outcome. Rats' behaviour converged to optimum choice, both during the acquisition phase and after the reversal of contingencies. We monitored the post-choice behaviour at a temporal precision of 1 millisecond; lick-response profiles revealed that rats spent more time at the spout with the higher reward probability and exhibited a sparser lick pattern. This was the case when we exclusively examined the unrewarded trials, where the outcome was identical. The differential licking profiles preceded the differential choice ratios and could thus predict the changes in choice behaviour

Crossref

Directory of Open Access Journals

PubMed Central

Integration of Sensory and Reward Information during Perceptual Decision-Making in Lateral Intraparietal Cortex (LIP) of the Macaque Monkey

Author: A Diederich
A Diederich
A Diederich
A Hays
A Rangel
AC Huk
AE Ipata
Alan E. Rorie
B Balleine
B Lau
BA Reddi
CJ Peck
CL Colby
CT Law
D Laming
D Lee
D Vickers
D Vickers
DM Green
DP Hanes
ES Bromberg-Martin
EV Evarts
G Corrado
GD Horwitz
GS Corrado
J Cavanaugh
J Cavanaugh
J Ditterich
J Ditterich
J Gottlieb
J Gottlieb
J Maunsell
James L. McClelland
JD Roitman
JI Gold
JI Gold
JR Muller
JT Klein
Juan Gao
JW Bisley
KF Wong
KF Wong
KH Britten
KH Britten
L Boucher
L Itti
LH Snyder
LP Sugrue
LP Sugrue
M Kusunoki
M Platt
M Shadlen
M Usher
MC Dorris
ME Goldberg
ME Mazurek
ML Platt
MN Shadlen
MS Bendiksby
P Glimcher
R Bogacz
R Kiani
R Kiani
R Ratcliff
R Ratcliff
R Ratcliff
R Ratcliff
RA Andersen
RH Carpenter
S Feng
S Grossberg
S Link
S Musallam
SJ Judge
T Moore
T Moore
T Moore
T Yang
TD Hanks
Vladimir Brezina
W Bair
William T. Newsome
WT Newsome
WT Newsome
XJ Wang
XJ Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Single neurons in cortical area LIP are known to carry information relevant to both sensory and value-based decisions that are reported by eye movements. It is not known, however, how sensory and value information are combined in LIP when individual decisions must be based on a combination of these variables. To investigate this issue, we conducted behavioral and electrophysiological experiments in rhesus monkeys during performance of a two-alternative, forced-choice discrimination of motion direction (sensory component). Monkeys reported each decision by making an eye movement to one of two visual targets associated with the two possible directions of motion. We introduced choice biases to the monkeys' decision process (value component) by randomly interleaving balanced reward conditions (equal reward value for the two choices) with unbalanced conditions (one alternative worth twice as much as the other). The monkeys' behavior, as well as that of most LIP neurons, reflected the influence of all relevant variables: the strength of the sensory information, the value of the target in the neuron's response field, and the value of the target outside the response field. Overall, detailed analysis and computer simulation reveal that our data are consistent with a two-stage drift diffusion model proposed by Diederich and Bussmeyer [1] for the effect of payoffs in the context of sensory discrimination tasks. Initial processing of payoff information strongly influences the starting point for the accumulation of sensory evidence, while exerting little if any effect on the rate of accumulation of sensory evidence

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Robustness of Learning That Is Based on Covariance-Driven Synaptic Plasticity

Author: A Soltani
AA Koulakov
B Ahmed
B Lau
BW Connors
CD Brody
CH Bailey
CK Machens
D Baras
DB Arnold
DJ Amit
DM Taylor
E Gardner
EE Fetz
GS Corrado
HS Seung
HS Seung
HS Seung
IR Fiete
JJ Hopfield
Karl J. Friston
KI Nagel
LP Sugrue
M Davison
MA Arbib
MV Tsodyks
NL Golding
P Dayan
R Gutig
R Kempter
R Shapley
RJ Herrnstein
RJ Herrnstein
RJ Herrnstein
RJ Williams
S Fusi
SC Turaga
SM Bohte
T Toyoizumi
TM Heskes
U Alon
Y Humeau
Y Loewenstein
Y Loewenstein
Y Sakai
Y Sakai
Yonatan Loewenstein
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

It is widely believed that learning is due, at least in part, to long-lasting modifications of the strengths of synapses in the brain. Theoretical studies have shown that a family of synaptic plasticity rules, in which synaptic changes are driven by covariance, is particularly useful for many forms of learning, including associative memory, gradient estimation, and operant conditioning. Covariance-based plasticity is inherently sensitive. Even a slight mistuning of the parameters of a covariance-based plasticity rule is likely to result in substantial changes in synaptic efficacies. Therefore, the biological relevance of covariance-based plasticity models is questionable. Here, we study the effects of mistuning parameters of the plasticity rule in a decision making model in which synaptic plasticity is driven by the covariance of reward and neural activity. An exact covariance plasticity rule yields Herrnstein's matching law. We show that although the effect of slight mistuning of the plasticity rule on the synaptic efficacies is large, the behavioral effect is small. Thus, matching behavior is robust to mistuning of the parameters of the covariance-based plasticity rule. Furthermore, the mistuned covariance rule results in undermatching, which is consistent with experimentally observed behavior. These results substantiate the hypothesis that approximate covariance-based synaptic plasticity underlies operant conditioning. However, we show that the mistuning of the mean subtraction makes behavior sensitive to the mistuning of the properties of the decision making network. Thus, there is a tradeoff between the robustness of matching behavior to changes in the plasticity rule and its robustness to changes in the properties of the decision making network

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

A neural integrator model for planning and value-based decision making of a robotics assistant

Author: A Agostini
A Bannat
A Billard
A Zunino
AA Koulakov
B Lau
BJ Rhodes
BR Cox
C Faubel
CD Brody
CD Brody
CE Curtis
CR Laing
E Bicho
E Bicho
E Bicho
E Sousa
EC Silva
ED Remington
Estela Bicho
F Ferreira
F Ferreira
Flora Ferreira
G Schöner
HS Seung
J Krüger
J Rankin
J Wang
K Iigaya
LP Sugrue
Luís Louro
M Haller
M Pardowitz
MH Histed
MP Mayer
N Cain
N Sünderhauf
O Lomp
P Choe
P Tsarouchi
P Wang
Paulo Vicente
R Kozma
R Silva
R Wilcox
RJ Herrnstein
S Amari
S Coombes
S Coombes
S Lemaignan
SJ Hu
T Machado
W Erlhagen
W Erlhagen
W Erlhagen
W Wei
Weronika Wojtak
Wolfram Erlhagen
Y LeCun
Y Lin
Y Sakai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Modern manufacturing and assembly environments are characterized by a high variability in the built process which challenges human–robot cooperation. To reduce the cognitive workload of the operator, the robot should not only be able to learn from experience but also to plan and decide autonomously. Here, we present an approach based on Dynamic Neural Fields that apply brain-like computations to endow a robot with these cognitive functions. A neural integrator is used to model the gradual accumulation of sensory and other evidence as time-varying persistent activity of neural populations. The decision to act is modeled by a competitive dynamics between neural populations linked to different motor behaviors. They receive the persistent activation pattern of the integrators as input. In the first experiment, a robot learns rapidly by observation the sequential order of object transfers between an assistant and an operator to subsequently substitute the assistant in the joint task. The results show that the robot is able to proactively plan the series of handovers in the correct order. In the second experiment, a mobile robot searches at two different workbenches for a specific object to deliver it to an operator. The object may appear at the two locations in a certain time period with independent probabilities unknown to the robot. The trial-by-trial decision under uncertainty is biased by the accumulated evidence of past successes and choices. The choice behavior over a longer period reveals that the robot achieves a high search efficiency in stationary as well as dynamic environments.The work received financial support from FCT through the PhD fellowships PD/BD/128183/2016 and SFRH/BD/124912/2016, the project “Neurofield” (PTDC/MAT-APL/31393/2017) and the research centre CMAT within the project UID/MAT/00013/2013

Universidade do Minho: RepositoriUM

Crossref

Modeling the Violation of Reward Maximization and Invariance in Reinforcement Schedules

Author: A Kacelnik
A Tversky
A Tversky
B De Martino
B Lau
B Marsh
Barry J. Richmond
CD Fiorillo
D Joel
D Kahneman
DE Bell
DM Egelman
EM Bowman
G La Camera
G La Camera
Giancarlo La Camera
HA Simon
HE Atallah
HR Arkes
HR Arkes
J O'Doherty
JH Zar
JM Simmons
JM Simmons
JM Simmons
JW Dickson
K Samejima
Karl J. Friston
KJ Arrow
KN Kirby
KR Janmaat
L Pompilio
LA Marascuilo
LD Brown
LJ Savage
LP Sugrue
M Haruno
M Pessiglione
M Shidara
M Shidara
N Schweighofer
N So
ND Daw
P Dayan
P Dayan
P Dayan
P Dayan
PJ Schoemaker
PL Meyer
PR Montague
R Thaler
RS Sutton
RS Sutton
S Kobayashi
S Ravel
SM McClure
W Schultz
WX Pan
Y Niv
Y Niv
Y Sugase-Miyamoto
Z Liu
Z Liu
Z Liu
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

It is often assumed that animals and people adjust their behavior to maximize reward acquisition. In visually cued reinforcement schedules, monkeys make errors in trials that are not immediately rewarded, despite having to repeat error trials. Here we show that error rates are typically smaller in trials equally distant from reward but belonging to longer schedules (referred to as “schedule length effect”). This violates the principles of reward maximization and invariance and cannot be predicted by the standard methods of Reinforcement Learning, such as the method of temporal differences. We develop a heuristic model that accounts for all of the properties of the behavior in the reinforcement schedule task but whose predictions are not different from those of the standard temporal difference model in choice tasks. In the modification of temporal difference learning introduced here, the effect of schedule length emerges spontaneously from the sensitivity to the immediately preceding trial. We also introduce a policy for general Markov Decision Processes, where the decision made at each node is conditioned on the motivation to perform an instrumental action, and show that the application of our model to the reinforcement schedule task and the choice task are special cases of this general theoretical framework. Within this framework, Reinforcement Learning can approach contextual learning with the mixture of empirical findings and principled assumptions that seem to coexist in the best descriptions of animal behavior. As examples, we discuss two phenomena observed in humans that often derive from the violation of the principle of invariance: “framing,” wherein equivalent options are treated differently depending on the context in which they are presented, and the “sunk cost” effect, the greater tendency to continue an endeavor once an investment in money, effort, or time has been made. The schedule length effect might be a manifestation of these phenomena in monkeys

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Spatiotemporal neural characterization of prediction error valence and surprise during reward learning in humans

Author: A Caplin
A Eklund
A Litt
AGE Collins
AJ Yu
B Knutson
B Seymour
BKH Chau
BY Hayden
CC Ruff
CD Fiorillo
CD Fiorillo
CF Zink
DR Bach
E Metereau
H Kim
HE Atallah
HEM Ouden den
J Gläscher
J Gläscher
J Jensen
J O’Doherty
J O’Doherty
JK Kruschke
JM Pearce
JM Walz
JN Keynan
JS Ide
JT McGuire
K D’Ardenne
K Foerde
K Preuschoff
K Wunderlich
KJ Friston
KJ Mullinger
KJ Mullinger
KT Kishida
LC Parra
LK Krugel
LP Sugrue
M Matsumoto
M Matsumoto
M-P Stenner
MC Dorris
MFS Rushworth
MG Philiastides
MG Philiastides
MG Philiastides
MG Philiastides
MG Philiastides
MJ Frank
MM Plichta
MR Delgado
MR Delgado
MR Roesch
N Kolling
ND Daw
P Bossaerts
P Dayan
P Dayan
PR Montague
R Akaishi
RB Rutledge
RI Goldman
S Gherman
S Iglesias
S Ikemoto
S Rudorf
S-L Lim
SM Smith
T Kahnt
T Nichols
TD Sambrook
TEJ Behrens
W Schultz
W Schultz
WF Asaad
Y Niv
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Reward learning depends on accurate reward associations with potential choices. These associations can be attained with reinforcement learning mechanisms using a reward prediction error (RPE) signal (the difference between actual and expected rewards) for updating future reward expectations. Despite an extensive body of literature on the influence of RPE on learning, little has been done to investigate the potentially separate contributions of RPE valence (positive or negative) and surprise (absolute degree of deviation from expectations). Here, we coupled single-trial electroencephalography with simultaneously acquired fMRI, during a probabilistic reversal-learning task, to offer evidence of temporally overlapping but largely distinct spatial representations of RPE valence and surprise. Electrophysiological variability in RPE valence correlated with activity in regions of the human reward network promoting approach or avoidance learning. Electrophysiological variability in RPE surprise correlated primarily with activity in regions of the human attentional network controlling the speed of learning. Crucially, despite the largely separate spatial extend of these representations our EEG-informed fMRI approach uniquely revealed a linear superposition of the two RPE components in a smaller network encompassing visuo mnemonic and reward areas. Activity in this network was further predictive of stimulus value updating indicating a comparable contribution of both signals to reward learning

University of Birmingham Research Portal

Plymouth Electronic Archive and Research Library

Repository@Nottingham

Oxford University Research Archive

University of Huddersfield Repository

Huddersfield Research Portal